PROGRAM FILES:

The file "main_data.m" is the MATLAB program to extract and arrange the data.
The file "rnd_regression.do" is the Stata program that generates the results reported in Tables 2-6 in the main text, and Tables J.3-J.9 in the online appendix.
The file "main_sampling.m" is the MATLAB program that generates the results reported in Table J.10 in the online appendix.
The file "compute_homogeneous_subsidy.m" is the MATLAB program that computes the homogeneous subsidy levels shown in Figure 6 (top panels) in the main text.
The file "compute_targeted_subsidy.m" is the MATLAB program that computes the targeted subsidy levels shown in Figures 6 (bottom panels), Figure 7, Tables 7 and Table 8 in the main text.

DATA:

We make use of data on R&D collaborations between private and publicly listed companies that is part of the Thomson and Reuters Securities Data Company (SDC) database. As this database is proprietary, we cannot make it publicly available. However, it can be accessed by any researcher through a subscription to Thomson and Reuters, and many universities actually have subscriptions to this database for their students, faculty and staff. More information about this database can be found below:

https://financial.thomsonreuters.com/en/products/data-analytics/market-data/sdc-platinum-financial-securities.html

Similarly, we make use of the Cooperative Agreements and Technology Indicators (CATI) database. For more information about the CATI database, or how to obtain it, below you will find the contact details of the database administrator :

Marc van Ekert
CATI Database Administrator
Telephone +(31)43-3883726, Secretary +(31)43-3883823, Fax +(31)43-3884893
E-mail : m.vanekert@os.unimaas.nl

Postal address: 
Marc van Ekert
Department of Organization and Strategy
Faculty of Economics and Business Administration
P.O. Box 616
6200 MD Maastricht
The Netherlands.
